NEWTON'S METHOD OVER GLOBAL HEIGHT FIELDS 



XANDER FABER AND ADAM TOWSLEY 



Abstract. Newton's method is used to approximate roots of complex valued functions / 
by creating a sequence of points that converges to some root of / in the usual topology. For 
any field K equipped with a set of pairwise inequivalent absolute values satisfying a product 
^ formula, we completely describe the conditions under which Newton's method applied to a 

£SJ \ squarefree polynomial f £ K [x] will succeed in finding some root of / in the v-adic topology 

for infinitely many places v of K . Furthermore, we show that if K is a finite extension of the 
\ rationals or of the rational function field over a finite field, then the Newton approximation 

pLl ■ sequence fails to converge w-adically for a positive density of places v. 

m 

Ft -1 ■ 1. Introduction 

Newton's method is one of the most efficient techniques for locating and approximating 
zeros of complex analytic or p-adic analytic functions in a single variable. Recall how it 
works: if f(x) is an analytic function with a simple root a, then for any x sufficiently close 
to a we have 
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Setting Nf(x) = x — jt^k, one can prove that the orbit (x n ) defined by x n = Nf(x n -x) for 
■ n > 1 converges quadratically to the root a. 

! Now let K be a global height field — that is, a field equipped with a set M# of pairwise 

inequivalent absolute values {| ■ \ v : v G Mk} satisfying an adelic property and a product 
formula. The basic examples are number fields and function fields of transcendence degree 1. 
(See Section 3.1 for more details.) Each v induces a different metric structure on K. Fix 
a univariate polynomial function / and an initial guess Xq, both defined over K. Define a 
•jS \ sequence recursively by x n = Nf(x n -i) for n > 1, where as above we set 



N f (x)=x-f(x)/f'(x). 

We will say that (x n ) is the Newton approximation sequence associated to / and xq if 
it is not eventually periodic. Following an investigation of the adelic convergence of orbits 
of more general dynamical systems [10], Silverman and Voloch asked the following question. 
(We paraphrase. See also Remark 1.6 below.) 

What can be said about the set of places v of K for which the Newton 
approximation sequence (x n ) converges to some root of /? 

Silverman and Voloch showed that the complement of the set of places of convergence is 
infinite. Later, the first author and Voloch showed that the set of places of convergence itself 
is infinite, unless there is a specific kind of dynamical obstruction. Theorem 1.1 summarizes 
these conclusions in the case of squarefree polynomials. Recall that if tp £ K(z) is a rational 
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function of degree at least 2, a point a G P 1 ^) is called an exceptional fixed point for ip if 
<p(cx) = a and tp is totally ramified at a. 

Theorem 1.1 ([7, 10]). Let K be a number field, and let (x n ) be the Newton approximation 
sequence associated to a squarefree polynomial f G K[x] of degree at least 2 and an initial 
point Xo G K. 

(i) The sequence (x n ) converges v-adically to the root a of f for infinitely many places 
v if and only if a is not an exceptional fixed point for the dynamical system Nf . In 
particular, if f is not quadratic, then (x n ) converges v-adically to some root of f for 
infinitely many places v. 

(ii) The sequence (x n ) fails to converge in F l (K v ) for infinitely many places v. 

The first statement says that the arithmetic conclusion about convergence to a root at 
infinitely many places is equivalent to a geometric conclusion about the dynamics of a certain 
fixed point for the Newton map. The statement itself requires only the language of global 
height fields, while its proof uses a Diophantine approximation technique specific to number 
fields. Our first goal in the present paper is to show that Theorem 1.1 (i) is really a statement 
about global height fields by using a result of the second author [11], which in turn is a 
consequence of Runge's method. 

Theorem 1.2. Let K be a global height field, and let (x n ) be the Newton approximation 
sequence associated to a point xq G K and a squarefree polynomial f G K[x] with deg(/) > 2. 
Then the sequence (x n ) converges v-adically to a root a of f for infinitely many places v of 
K if and only if a is not an exceptional fixed point for the dynamical system Nf . 

Remark 1.3. The condition that / be squarefree implies that /' is not identically zero, and 
hence that Nf is well defined. Moreover, if K has characteristic p > and g is a function 
of x p , say g = h(x p ), then Nf. g = Nf, which means the Newton map ignores inseparable 
factors. For purposes of root finding, it is also reasonable to assume that / is squarefree by 
replacing / with // gcd(/, /') if necessary. 

The conclusion in Theorem 1.1 (i) that (x n ) converges for infinitely many places v if 
deg(/) > 2 is a consequence of a classification of polynomials / that have a dynamically 
exceptional root a — i.e., a root that is an exceptional fixed point for Nf. For global height 
fields of characteristic zero, this classification carries over verbatim; with some modification 
of the argument in positive characteristic, one can still give a classification of polynomi- 
als / for which all of its roots are exceptional fixed points for Nf (Proposition 2.9). This 
immediately yields the following corollary. 

Corollary 1.4. Let K & be a fixed algebraic closure of K. With the notation of Theorem 1.2, 
the sequence (x n ) converges v-adically to some root of f for infinitely many places v of K 
unless one of the following is true: 

• f is quadratic; 

• K has characteristic p > and there exist A,B,C G K & and an integer r > 1 such 
that Af(Bx + C) = x pr -x; or 

• K has characteristic p > and there exist X,A,B,C G K & and an integer r > 1 
such that Af(Bx + C) = x pr+1 - (A + l)x pr + Ax. 
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We now discuss quantitative extensions of Theorem l.l(ii). The first author and Voloch 
conjectured that something much stronger should be true in the case of number fields: the 
set of places v of K for which (x n ) does not converge -u-adically to any root of / has density 
one when ordered by norm. They were only able to provide numerical and heuristic evidence 
to this effect. A clever technique using the Chebotarev density theorem was later developed 
by several authors to address a related question on collisions of orbits modulo primes [2]. It 
implies that Newton's method fails to converge to some root for a positive density of places 
of a number field [2, Thm. 4.6]. (It does not give an effective lower bound on the density.) 
We are able to give a unified proof of positive density for any global field. 

Theorem 1.5. Let K be a finite extension of the rational field Q or of the rational function 
field ¥ P (T) . Assume the setup of Theorem 1.2. The set of places v of K for which the Newton 
approximation sequence (x n ) diverges v-adically has positive lower density when ordered by 
norm. 

Remark 1.6. Silverman and Voloch proved a preliminary form of the above theorem for a 
number field K or an arbitrary function field K/k of transcendence degree 1. They showed 
that the set of places v of K for which the Newton approximation sequence diverges is infinite 
[10, Remark 10]. Surprisingly, they were able to show that the result continues to hold in 
the function field setting even when the map is purely inseparable, and we model the purely 
inseparable case of our proof on theirs. 

We now update the main conjecture of [7] to include the case of function fields: 

Conjecture 1.7 (Newton Approximation Fails for 100% of Places). Let K be a finite exten- 
sion of the rational field Q or of the rational function field ¥ p (T). Let (x n ) be the Newton 
approximation sequence associated to a squarefree polynomial f G K[x] of degree at least 2 
and a point x G K . Define C(K, f, x ) to be the set of places v of K for which (x n ) fails to 
converge v-adically to some root of f . Then the set of places C(K, f,Xo) has natural density 
one when ordered by norm. 

For other global height fields K, we expect the set of places of divergence to be "large," but 
there is some issue in clarifying what "large" should mean. We are able to give a satisfying 
answer in the case where K/k is the function field of a curve Y defined over a global height 
field k. Here "large" may be quantified in terms of a height function on Y. 

Theorem 1.8. Let k be a global height field, and let K/k be a function field of transcendence 
degree one. Fix a nonsingular connected curve Y/k such that K = k(Y), and fix a height 
function hy on Yik 3 ') associated to an ample divisor. Let (x n ) be the Newton approximation 
sequence associated to a squarefree polynomial f G K[x] of degree at least 2 and a point 
Xq G K. Then there exists a positive real number M = M(f,XQ,hy) such that (x n ) fails to 
converge in K y for all y G Y(k a ) satisfying hy(y) > M. 

More generally, one can place a fundamental cardinality restriction on the set of places at 
which Newton approximation may detect a root of a polynomial. 

Proposition 1.9. Let K be a global height field, and let (x n ) be the Newton approximation 
sequence associated to a squarefree polynomial f G K[x] of degree at least 2 and a point 
Xo G K. The sequence (x n ) converges v-adically to a root of f for at most a countable 
number of places v of K. 
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The proof is trivial. Pass to a finite extension of K if necessary so that / splits completely. 
For each of the countably many pairs (n, a) G N x Zeros(/), the inequality \x n — a\ v < 1 
may only be satisfied for finitely many places v of K. 

Morally, the above proposition holds because there is a finitely generated subfield of K 
over which / and Xq are defined, and a finitely generated field admits only countably many 
places. Note that, unlike Q or W p (T), the places of an arbitrary finitely generated field do 
not come with any sort of canonical ordering with which to measure density. With these 
observations in mind, we ask the following: 

Question 1.10. Is there a meaningful way to assert that Newton approximation fails for a 
"large" set of places of an arbitrary finitely generated global height field? 

To close the introduction, we describe the contents of this paper. Section 2 describes a 
number of geometric properties of the correspondence / t— > Nf between polynomials and 
Newton maps. This includes the following dynamical characterization: Up to conjugation, 
Newton maps are precisely the rational functions of degree d > 2 that have at least d critical 
fixed points. The results in Section 2 are well known in characteristic zero, so our goal is to 
compare and contrast the behavior of this correspondence in characteristic zero with that in 
positive characteristic. We prove Theorem 1.2 in Section 3, and the proof of Theorems 1.5 
and 1.8 will occupy Section 4. 

Acknowledgments. The authors would like to thank Tom Tucker for recommending that 
they embark on this collaboration, and for suggesting the use of Call/Silverman specialization 
in the proof of Theorem 1.8. They also thank the anonymous referee for the observation 
that led to our formulation of Proposition 1.9, and Asher Auel for useful comments on an 
earlier draft of the manuscript. 

2. Geometry of the Newton map 

Throughout this section, we fix a separably closed field L of characteristic p > 0. We begin 
by describing separability properties of the Newton map Nf in terms of the polynomial /. 
Then we discuss the fixed points of the Newton map and use them to give a dynamical 
characterization of rational functions that are conjugate to a Newton map. In the final 
subsection we characterize those squarefree polynomials / for which every root is a totally 
ramified fixed point of the Newton map. 

2.1. Inseparability. 

Definition 2.1. A rational function tp G L{x) is inseparable if L has positive characteristic 
and (p can be written as p(x) = ip (x p ) for some ip G L(x); <p is said to be separable 
otherwise. By the Hurwitz formula, ip is inseparable if and only if its formal derivative is 
identically zero. We say that <p is purely inseparable if <p (x) = rj (x pr ) with r > and rj 
fractional linear. 

It will be important for our arithmetic questions to know which polynomials / give rise 
to purely inseparable Newton maps Nf. 

Remark 2.2. A nonconstant rational function <p G L(x) is separable (resp. inseparable, 
purely inseparable) if and only if the field extension L{x)/L{p) is separable (resp. insepara- 
ble, purely inseparable) in the sense of classical field theory. 
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Proposition 2.3. Suppose that L has positive characteristic, and let f G L [x] be a squarefree 
polynomial of degree at least 2. 

(i) The Newton Map Nf is inseparable if and only if f (x) = xg (x p ) + h (x p ) for poly- 
nomials g,h G L[x}. 

(ii) The Newton map Nf is purely inseparable if and only if f (x) = xg (x pr ) + h (x pr ) 
for polynomials g, h G L [x] of degree at most 1 and a positive integer r. 



Proof. Since / is squarefree, its derivative cannot vanish identically, and Nf is well defined. 
If / (x) = xg (x p ) + h (x p ), then Nf (x) = — , which is inseparable. 

Conversely, assume that Nf is inseparable. By definition, Nf (x) = x ^ yfer^ ; and the 
numerator and denominator have no factor in common since / is squarefree. It follows that 
/' (x) = g (x p ) for some g G L [x] . Integration yields that / (x) = xg (x p ) + h (x p ) for some h. 
This completes the proof of (i). 

For (ii), an immediate calculation shows that Nf is purely inseparable if / has the given 
form. Conversely, if Nf is purely inseparable, then part (i) shows that f(x) = xg (x pr ) + 
h (x pS ) for some polynomials g, h and some integers r,s > 0. We may assume that r and s 

have r = s and h/g is fractional linear. □ 



are chosen to be maximal. Then Nf(x) = — j — ft- As Nf is purely inseparable, we must 



2.2. Degree and fixed points. Typically, a polynomial of degree d will give rise to a 
Newton map of degree d. However, in positive characteristic, the Newton map will have 
degree d — 1 when d = 1 (mod p). This difference has only a slight effect on the basic 
properties of the correspondence / i— > Nf. 

Proposition 2.4. Let f G L[x] be a squarefree polynomial of degree d > 2. 
(i) The Newton map has degree 



deg(N f ) 



if d ^ 1 in L 
1 if d = 1 in L. 



(ii) Let ctrf be the roots of f . Then the fixed points of the Newton map are 

Fix(N f ) = 




. . , a,i, oo} if d ^ 1 in L 
. . , a^} if d = 1 in L. 

(iii) Each of the roots cti of f is a critical fixed point for Nf. If d ^ 1 in L, then oo 
has fixed point multiplier In particular, if d = or 1 in L, then all of the fixed 
points of Nf are critical. 

Proof. The squarefree hypothesis shows there is no common factor shared between the nu- 
merator and denominator of Nf(x) = (xf — f) //'. 

For part (i), observe that the leading terms of the numerator and denominator of Nf are 
(d — l)x d and dx d ~ l , respectively. We are finished unless d = 1 in L and Nf = 0. But 
then we find that f(x) = xg(x p ) + h(x p ) for some polynomials g, h that are at most linear 
(Proposition 2.3). Computing Nf shows that h — 0, and the squarefree hypothesis on / 
forces g to be constant. As we assumed deg(/) > 2, this is a contradiction. 
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For part (ii), it is clear that the a:, are fixed points of Nf. If d ^ 1 in L, then the 
previous paragraph shows that the numerator of Nf has larger degree than the denominator, 
so infinity is a fixed point. A rational function of degree 5 has 5 + 1 fixed points, so we have 
found them all. 

For (iii), we pick a root a of / and write f(x) = (x — a)g(x) for some polynomial g. Now 

Nf{x) -a = (x-a)- —-— = (x - a) — — . 

f'(x) g(x) + (x- a)g\x) 

Since / is squarefree, we find g(a) ^ 0, and Nf(x) is ramified at a. 
If d 1 in L, we can write 

N f (x) = V- 1 ^^- 

where we are ignoring lower order terms. We compute the fixed point multiplier at oo by a 
change of coordinate and ignoring higher order terms: 

Corollary 2.5. If d = 1 in L, then there is no Newton map of degree d. 

The following result gives a dynamical description of Newton maps. It is certainly well 
known in the case of fields of characteristic zero; see for example [9]. The only difference that 
arises in positive characteristic is that it is possible to have rational functions all of whose 
fixed points are critical. Such a function is necessarily conjugate to a Newton map. 

Proposition 2.6. Fix an integer d > 2, and let p> e L(x) be a rational function of degree d. 
The following are equivalent: 

(i) <p is conjugate to a Newton map Nf for some squarefree polynomial f G L[x\; 

(ii) tp has at least d critical fixed points; and 

(iii) If d 7^ in L (resp. d = in L), then tp has exactly d (resp. exactly d+1) critical 
fixed points. 

Proof, (i) (ii). Evident from Proposition 2.4. 

(ii) =^ (i). Suppose that </? has at least d critical fixed points, and write ip = A/B with A 
and B coprime polynomials. If one of the fixed points of p> is not critical, we may conjugate 
it to oo without loss of generality. Otherwise, let us conjugate any of the critical fixed points 
to oo. In particular, we may assume that deg(£>) < deg(A). 

Set f[x) = xB(x) — A(x); the roots of / are exactly the finite fixed points of ip. We claim 
that Nf = tp. First, we show that A' = xB' . The finite critical points of ip are the roots of 

BA' - AB' = B(A' - xB') + B'(xB - A). 

Since every finite fixed point is also a critical point, xB — A \ B(A' — xB'). As A and 
B are coprime, we conclude that xB — A | A' — xB'. But now deg(xB — A) = d, while 
deg(A' — xB') < d — 1. Hence, A' = xB', as desired. This relation implies that /' = 
xB' — A' + B = B, and 

gcd(/, /') = gcd(x5 - A, B) = gcd(A B) = l. 

That is, / is squarefree. A direct computation shows that Nf = A/B, as desired. 

(iii) =^> (ii). Evident. 
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(ii) =>- (iii). Write Ao, • • • , A^ for the multipliers at the d + 1 (distinct) fixed points of <p, 
and let us suppose that Ai = ■ ■ • = A^ = 0. The fixed point index formula shows that 

^l-Xt 1 - A 

We see that d = in L if and only if Ao = 0. □ 

2.3. Dynamically exceptional roots. 

Definition 2.7. A root a of f will be called dynamically exceptional if it is exceptional 
for the Newton map; i.e., its backward orbit {7 G P 1 (L) : NJ(j) = a some n > 0} is finite. 
Equivalently, since roots of / are fixed by Nf, a is dynamically exceptional if and only if 
it is totally ramified for Nf. It follows from the Hurwitz formula that if Nf is not purely 
inseparable, then it has at most two exceptional points. 

The following lemma will allow us to move the roots of / and adjust it to be monic without 
affecting the dynamical properties of the Newton map. 

Lemma 2.8 ([7, Prop. 2.4]). Let f,g G L[x] be polynomials of degree d related by the 
formula g(x) = Af(Bx + C), where A, B,C G L and AB 7^ 0. Set a{x) = Bx + C . Then 
N g = cr" 1 oNfOa. 

Proposition 2.9. Let f G L[x] be a squarefree polynomial of degree d > 2. 

(i) If d = 2, then every root of f is dynamically exceptional. 

(ii) If d > 2 and d 7^ 1 in L, then every root of f is dynamically exceptional if and only 
if L has positive characteristic and there exist A,B,C G L and an integer r > 1 
such that Af(Bx + C) = x pr - x . 

(iii) If d > 2 and d = 1 in L, then every root of f is dynamically exceptional if and only 
if L has positive characteristic and there exist A, A, B,C G L and an integer r > 1 
such that Af(Bx + C) = x pr+1 - (A + l)x pr + Ax. 

Proof. Suppose first that / is quadratic, and that a, (3 are the roots of /. Then both a and f3 
are critical fixed points for Nf. A critical fixed point for a quadratic map is totally ramified, 
so that both a and (3 are dynamically exceptional for /. 

Now assume that d = deg(/) > 2. It was shown in [7, Cor. 1.2] that when L has 
characteristic zero, a polynomial has at most one dynamically exceptional root. For the 
remainder of the proof, we may therefore assume that L has characteristic p > 0. 

Write D for the degree of Nf. Suppose that a, (3 are distinct roots of / that are totally 
ramified fixed points for Nf. By applying Lemma 2.8, we may assume without loss that / 
is monic and that a = and (3 = 1. By conjugating (3 to 00 while fixing 0, we see that 
is conjugate to <p(x) = x D . At least one other fixed point of Nf is totally ramified, so the 
same must be true of (p. The fixed points of ip are 0, 00, and the {D — l) st roots of unity. If 
( is such a root of unity, we find that 

V (x)-( = x D -C = (x-() D 

if and only if D = q is a power of the characteristic and £ G ¥ g . For a G L \ {0}, write 
o~(x) = ax/(x — 1) for a fractional linear transformation that fixes and moves 1 to 00. 
There exists a G L such that 

n q-l T q n <i-^ T q 

N f{x) = a~ x o ^ o o{x) = -. — = -. 2.1 

1 w v ' cfl~ x xi - (x - l)i ai- l xi - xi + 1 
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If d 7^ 1 in L, then Nf fixes infinity, which means a 9 " 1 = 1. Thus Nf(x) = x q . Propo- 
sition 2.3 shows that f(x) = xg(x q ) + h(x q ) for polynomials g, h that are at most linear; 
computing Nf directly shows that g — — 1 and h = x. (Recall that we have assumed / is 
monic.) 

If instead d = 1 in L, then Nf does not fix infinity, so that a^ 1 ^ 1. Replacing a 9-1 with 
1 + 1/A in (2.1) shows that 

NAx) = <*±i£. 

1 K ' x q + A 

As in the previous case, f(x) = xg(x q ) + h(x q ) for some linear polynomials g, h; computing 
Nf directly shows that g = x + A and h = —(A + l)x. 

Conversely, if / is of either of the two special forms in the statement, then Nf is purely 
inseparable (Proposition 2.3), which implies that all of its fixed points are totally ramified. 

□ 



3. Success of Newton's method over global height fields 

3.1. Global height fields. We follow the notation and conventions in [3, §1]. 

Definition 3.1. A field A will be called a global height field if it is equipped with a 
collection Mk of inequivalent nontrivial absolute values | ■ |„ such that the following two 
conditions hold for each a G K \ {0}: 

• (Adelic Property) The set {v G M K : |a|^ ^ 1} is finite, and 

• (Product Formula) \a\ v = 1. 

veM K 

Examples are given by number fields and function fields k(X), where A is a geometrically 
irreducible normal projective variety defined over a field k. 1 See also [5] and [8, §2] for further 
interesting examples. Every finite extension K' /K may be endowed with the structure of a 
global height field where each absolute value on K' restricts to one in Mk, up to equivalence. 
We write Mk> for the corresponding set of absolute values on K' normalized as in [3, §1.3.6, 
1.3.12]. (The normalization is unimportant for the results presented here.) 

Let A' be a global height field with a fixed algebraic closure A a . The absolute logarithmic 
height function h : P x (A' a ) — > IR> is defined as follows. Let a = (a : «i) G P^A 21 ), and let 
K'/K be a finite extension containing the coordinates of a. Then we define 

h{a) = ^2 log m ax{|a U \ai\ w }. 

w€M K , 

By the product formula, the sum is independent of the choice of homogeneous coordinates for 
a, and the normalizations of the elements of Mk> are arranged so that h(a) is independent 
of the choice of extension K'/K containing a. 

3.2. Arithmetically exceptional roots. Throughout this section, A will denote a fixed 
global height field. 



Fix a nontrivial numerically effective divisor H on X. Each prime divisor Y on X gives rise to an absolute 
value | • \y defined by |/|y = exp(— nyordy (/)) for / e k(X), where ny = Y ■ Jfdimpf)-i_ 
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Definition 3.2. Let ( Newton approximation sequence associated to some squarefree 

polynomial / G K[x] and some xq G K. (Recall that we assume (x n ) is not eventually 
periodic.) A root a G K a of / will be called arithmetically exceptional for / if the 

Newton approximation sequence (x n ) converges to a in K v for only finitely many places 
v G M K . 

Definitions 2.7 and 3.2 allow us to restate Theorem 1.2 as an equivalence between arith- 
metic and dynamical exceptionality: 

Theorem 3.3. Let (x n ) be a Newton approximation sequence associated to some squarefree 
polynomial f G K[x] and a point xq G K. Then a root of f is arithmetically exceptional if 
and only if it is dynamically exceptional. 

The proof is a consequence of the following result of the second author. Note that although 
it was originally stated for function fields over a finite constant field, its proof uses only the 
properties of a global height field. 

Theorem 3.4 ([11, Cor. 3.3]). Let if G K{x) be a rational function of degree at least 2. Let 
/3, 7 G P^iT). Suppose that 7 is periodic and not exceptional for ip. If ft is not preperiodic, 
then there exist infinitely many places v of K such that \(p n (f3) — j\ v < 1 for some n > 0. 

Proof of Theorems 1.2 and 3.3. By passing to a finite extension if necessary, we may assume 
that all of the roots of / lie in K. Let a G K be such a root. Write / as / (x) = (x — a) g(x) 
for some polynomial g G K[x] that does not vanish at a. Define S a to be the finite set of 
places v G M K such that at least one of the following is true: 

• The leading coefficient of /' has w-adic absolute value different from 1; 

• | a — (3\ v 7^ 1 for each root (3 of /'; 

• < 1 for each root (3 of /'; and 

• \di\v < 1- 

Let v G Mk \ S a , and suppose that n > is such that \x n — a\ v < 1. By definition, 



x n+ \ — a = N (x n ) — a = x n — a 



[x n - a)g(x n ) 



, x2 9 \ x r. 

[x n - a) 



[x n - a) g' (x n ) + g (x r , 
j 



f'M 

As v ^ S a , the denominator of this last expression has w-adic absolute value 1. Hence, 
\x n+ i — a\ v < \x n — a\l- By induction, \x n+ £ — a\ v < \x n — a\l for every £ > 0. It follows 
that the Newton approximation sequence (x n ) converges to a in the v-adic topology for 
v G Mk \ iS'q, if and only if x n lies in the strict open disk of radius 1 about a for some n > 0. 

Now suppose that a is not dynamically exceptional for /. Theorem 3.4 may be applied 
in our setting with (3 = xq and 7 = a to show there are infinitely many places v for which 
\x n — a\ v < 1 for some n. So a is not arithmetically exceptional by the preceding paragraph. 

Finally, suppose that a is dynamically exceptional. Then 

, > (x- a) de sW 

N f (x)-a = c± jjL- 



for some nonzero c G K. Setting x = x n , we see that 

z„ +1 -a = c ; , w . (3.1) 

Let be 5 a along with the finitely many other v for which |c|„ 7^ 1. Take t> G' S^, and 
suppose that \x n +i — a\ v < 1 for some n > 0. Then either \f'(x n )\ v > 1 or \x n — a\ v < 1. 
If the former is true, then our hypothesis on v shows that \x n — 0\ v > 1 for some root 

of /', in which case \x n \ v > 1. But (3.1) shows that |x n+ i — a|„ = |x n |^ eg( ' 7V/ ' ) dcg ^ ^ > 1, a 
contradiction. So we are forced to conclude that \x n — <y\ v < 1 whenever |x n _|_i — ot\ v <C 1. 
By induction, |x n — a|„ < 1 if and only if |x — < 1. As (x n ) is not eventually periodic, 
there are only finitely many places v for which this last inequality can hold. Hence, a is 
arithmetically exceptional. □ 

4. Failure of Newton's method 

4.1. Global fields. Throughout this subsection, we let K be a global field — that is, a 
finite extension of the field of rational numbers Q or of the function field F P (T) for some 
rational prime p. Let K a be a fixed algebraic closure of K. We write Ok for the integral 
closure of Z or ¥ P [T] inside K. In the function field setting, this is equivalent to saying that 
K is the function field of a smooth connected projective curve C/¥ p endowed with a finite 
morphism C — > , and Ok is the ring of rational functions on C that are regular away from 
the points lying above 00 G P^. 

Write M K for the set of finite places of K ; each place v G M K may be canonically identified 
with a unique prime ideal p v G 0^, and conversely. We will interchange v and p v as needed 
without further comment. The norm of v G M K , denoted Nv, is the order of the residue 
field associated to p v : Nv = #0x/pv The (natural) lower density of a set of places S C M K 
is defined to be 

8(S) = liminf — ^ tth — tt— — ttt- 

- v ; x^oo #{ v G M K : Nv < X} 

Remark 4.1. Only finitely many places of K are ignored by focusing on M K , so this omission 
has no effect on any of the results in this section. 

The proof of Theorem 1.5 will follow the general strategy of [7], but in place of the primitive 
prime divisor result from [6], we use a result of Benedetto et al. for number fields [2] and 
the corresponding result of the second author for function fields [11]. For the statement, we 
recall that if K is a function field of characteristic p and tp G K(x) is an inseparable rational 
function, then it may be written as (fix) = ip{x pr ) for some r > 1 and some separable 
rational function ip. The degree of ip is called the separable degree of tp. Equivalently, the 
separable degree of ip is the number of i^ a -rational solutions to the equation <p(x) = (3 for a 
generic choice of /3. To say that the separable degree of tp is greater than 1 is to say that tp 
does not induce a bijection on F l (K a ). 

Theorem 4.2 ([2, Thm. 1.1], [11, Thm. 4]). Let K be a global field, and let tp G K(x) be 
a rational function of degree at least 2. If K is a function field, suppose further that tp has 
separable degree at least 2. Let 7 G ¥ l (K) be a point with infinite tp- orbit, and let B C F l {K) 
be a finite set of preperiodic points for tp . Then the set of places V C M K such that tp n ( , ~f) ^ 
(mod p) for any n > 0, any G B, and any p G V , has positive lower density. 
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Proof of Theorem 1.5. We wish to show that the set of places v of the global field K at which 
(x n ) fails to converge v-adically to a root of / has positive lower density. Let us suppose 
first that Nf is not purely inseparable. Let B denote the set of fixed points of Nf. (See 
Proposition 2.4.) A continuity argument shows that if (x n ) converges in P 1 (F P ) for a prime 
p G M K , then it converges to an element of B. Applying Theorem 4.2 with 7 = xq shows 
that the sequence (x n ) fails to t>-adically approach B for a set of places v of K with positive 
lower density. 

Now we suppose that Nf(x) is purely inseparable; hence, K necessarily has positive char- 
acteristic p. After conjugating if necessary, we may replace Nf with x q for q some power 
of p. It suffices to show that (x n ) converges v-adically to an element of P 1 (F g ) for at most 
finitely many places v. Let a G ¥ q . Write yo = xq — a. A suitable induction hypothesis 
shows that x n = a + yl . Then for any place v, we see that \x n — a\ v = \yo\l , so that (x n ) 
converges w-adically to a only for those places v for which \yo\ v < 1. The set of such places 
v is finite. A similar argument applies in the case a = 00 if we set y = 1/xq. □ 

4.2. Function fields over global height fields. Before beginning the proof of Theo- 
rem 1.8, we recall that if k is a global height field with height function hk, and if <p G k(x) is a 
rational function of degree at least 2, there is a canonical height function hk^ '■ P 1 (fc a ) — > M>o 
characterized by the following two properties: 

• h k)tp ((p(a)) = deg(<p) ■ h k>ip {a) for any a G P 1 ^ 21 ). 

• The difference \hk^{a) — hk{ot)\ for a G F 1 (k a ) may be bounded by a constant that 
does not depend on a. 

Recall also that if Kj k is a function field as in the statement of Theorem 1.8, we say that a 
rational function (p G K(x) is isotrivial if there is a G PGL2(i^ a ) such that cro^ocr" 1 G k a (x). 
That is, up to a coordinate change and a finite extension of K, the function if is defined over 
the constant field. We prove the following fact about canonical heights for isotrivial maps: 

Lemma 4.3. Let K/k be a function field over a global height field k, let tp G K (x) be an 
isotrival rational function, and let xq G K. Suppose that the isotriviality off is witnessed by 
a G PGL 2 (K a ); that is, a o ip o a v G k(x). The canonical height hx^ (a?o) ~~ >* 00 as n ~ * 00 
unless xq is preperiodic or a(xo) G P x (A; a ). 

Proof. If xq is preperiodic, then fiK,ip (xo) = 0, so we assume that xq is not preperiodic. Let 
:= a o ip o a" -1 G k (x) and define £ n = a (x n ). Observe that = V'(^n) f° r every n > 0. 
If £0 G /c a , then hj((Cn) — for all n. This implies hx^i^n) — for all n. Since canonical 
heights are conjugation invariant, we conclude that h,K, v {x n ) = for all n. 

Finally, we assume that £0 is not defined over (an extension of) the constant field. There 
necessarily exists a place z of K such that |£o|.z > 1- Write ip = g/h for some coprime 
polynomials g, h G k[x]. The coefficients of g and h all have z-adic absolute value 1; 
hence, max{\g(£ )\ z , \h(£ )\ z } = |£ |*, where d = deg(^) = max{deg(£f), deg(h)}. Since 
= g(£n-i) / h(£n-i) f° r a h n > 1, we can proceed by induction to show that 

max{|^n_i)UKa-i)| 2 } = l£o|f. 

It follows that 

h K (Q = l °& maX {\9(tn~l)\y, \h(C n -l)\y} > log \£ \f ■ 

y€M K 
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Hence, hxiCn) oo as n -> oo. Since the canonical height hx,^ differs from the usual height 
Hk by a bounded amount, hK,^{^ n ) oo, and conjugation invariance of canonical heights 
shows that h,K, v (x n ) — > oo as well. □ 

Using this lemma, a theorem of Baker, and the specialization result of Call and Silverman, 
we now prove Theorem 1.8. 

Proof of Theorem 1.8. Suppose first that the Newton map Nf is not isotrivial. Since Xq is not 
preperiodic for Nf, we apply Baker's theorem [1, Cor. 1.8] to conclude that the canonical 
height h,K,N f ( x o) is positive. Write x n>y for the specialization of x n at y G F(/c a ), and 
similarly write Nf tV for the induced rational function with coefficients in k a . Write deg(F) 
for the degree of the ample divisor associated to hy. Then Call/Silverman specialization [4, 
Thm. 4.1] shows that 

h k ,N Ly (x ,y) ~ tlK,N f (xo) ■ ^g|y) aS h Y(v) ~^ 00 ' 

In particular, if hy(y) is sufficiently large, then hk,N f v ( x o,y) > 0. It follows that xq, v is not 
preperiodic for the specialized map Nf >y when y has large height, and in particular this means 
that it cannot ever encounter a fixed point of the map Nf jV . This is sufficient to prove that 
(x n ) does not converge y-adically to a root of /. 

Now suppose that Nf is isotrivial, and choose a G PGL 2 (K a ) such that ip '■= o-otpoa^ 1 G 
k a (x). Set £ n = a{x n ). Note that £ n , = for each n > 1. If £ ^ k a then we apply 

Lemma 4.3 to conclude that hx^ (£ n ) > f° r & H n ^ n o- ^ n o > 0, then replace Xo with x no . 
The strategy in the previous paragraph applies again to show that (x n ) does not converge 
y-adically to a root of / as long as hy(y) is sufficiently large. 

Finally, suppose that Nf is isotrivial as in the last paragraph, and that £o = cr(^o) £ k a . 
Without loss of generality, we will assume it lies in F l (k). The entire sequence (£ n ) must 
also be defined over k. But (£ n ) is therefore discrete in F l (K y ) for every place y, and so it 
cannot converge unless it is eventually constant. This contradicts the hypothesis that (x n ) 
is not eventually periodic. We conclude that Newton approximation fails to converge for all 
places y in this case. □ 

References 

[1] Matthew Baker. A finiteness theorem for canonical heights attached to rational maps over function 

fields. J. Reine Angew. Math., 626:205-233, 2009. 
[2] Robert Benedetto, Dragos Ghioca, Benjamin Hutz, Par Kurlberg, Thomas Scanlon, and Thomas Tucker. 

Periods of rational maps modulo primes. Mathematische Annalen, pages 1-24, 2012. doi:10.1007/s00208- 

012-0799-8. 

[3] Enrico Bombieri and Walter Gubler. Heights in Diophantine geometry, volume 4 of New Mathematical 

Monographs. Cambridge University Press, Cambridge, 2006. 
[4] Gregory S. Call and Joseph H. Silverman. Canonical heights on varieties with morphisms. Compositio 

Math., 89(2):163-205, 1993. 
[5] Pietro Corvaja and Umbcrto Zannier. Arithmetic on infinite extensions of function fields. Boll. Un. Mat. 

Ital. B (7), 11(4):1021-1038, 1997. 
[6] Xander Faber and Andrew Granville. Prime factors of dynamical sequences. J. Reine Angew. Math., 

661:189-214, 2011. 

[7] Xander Faber and Jose Felipe Voloch. On the number of places of convergence for Newton's method 
over number fields. J. Theor. Nombres Bordeaux, 23(2):387-401, 2011. 

12 



[8] Serge Lang. Diophantine geometry. Interscience Tracts in Pure and Applied Mathematics, No. 11. In- 
terscience Publishers (a division of John Wiley & Sons), New York-London, 1962. 

[9] Kiyoko Nishizawa and Masayo Fujimura. Families of rational maps and convergence basins of Newton's 
method. Proc. Japan Acad. Ser. A Math. Scl, 68(6):143-147, 1992. 
[10] Joseph H. Silverman and Jose Felipe Voloch. A local-global criterion for dynamics on P 1 . Acta Arith., 
137(3):285-294, 2009. 

[11] Adam Towslcy. A Hassc principle for periodic points. arXiv: 1209.2399 [math. NT], 2012. 

Department of Mathematics, University of Hawaii, Honolulu, HI 
E-mail address: xander@math.hawaii.edu 

Department of Mathematics, CUNY Graduate Center, New York, NY 
E-mail address: atowsley@gc.cuny.edu 



13 



